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Abstract 

Consider a game where a referee chooses (x,y) according to a publicly known 
distribution Pxy, sends x to Alice, and y to Bob. Without communicating with 
each other, Alice responds with a value a and Bob responds with a value b. Alice 
and Bob jointly win if a publicly known predicate Q(x,y,a,b) holds. 

Let such a game be given and assume that the maximum probability that 
Alice and Bob can win is v < 1. Raz (SIAM J. Comput. 27, 1998) shows that 
if the game is repeated n times in parallel, then the probability that Alice and 



Bob win all games simultaneously is at most t) log M, where s is the maximal 
number of possible responses from Alice and Bob in the initial game, and v < 1 
is a constant depending only on v. 

In this work, we simplify Raz's proof in various ways and thus shorten it 
significantly. Further we study the case where Alice and Bob are not restricted 
to local computations and can use any strategy which does not imply communi- 
cation among them. 

1 Introduction 

The question how much parallel repetition of a game as in the abstract reduces the 
winning probability of the players was motivated by the study of two-prover interactive 
proofs, initiated by Ben-Or et al. [BOGKW88] , It was first conjectured that in a game 
which is repeated n times in parallel, the probability that Alice and Bob win all the 
games simultaneously is at most v n (sec FRS94 ). However, later a counterexample 
to this conjecture was given |For89j . 

Related Work Various papers give upper bounds on the winning probability of a 
game which is repeated n times in pa rallel [CCL92| IFei91| ILS95| IRaz98l IVer94j . How- 
ever, the upper bound given by Raz [Raz98j is the only explicit bound for arbitrary 
distributions Pxy (it is also quantitatively the strongest). Parnafes, Raz, and Wigder- 
son [PRW97J modify Raz's proof to show that the term log(s) can be replaced by a 
parameter which is much smaller for some games. 
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author was at ETH Zurich. 
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Games for which the n-fold parallel repetition decreases the winning probability 
less than from v to v n were also constructed: Fortnow |For89j givesj a game for 
which the maximal winning probability in two repetitions is larger than v 2 (see also 
FL92j), Feige |Fei91j constructs a game where the winning probability in two parallel 
repetitions does not decrease at all, and Feige and Verbitsky |FV02j give, for infinitely 
many s, a game where Q( lo 1 ^^- ) ) repetitions decrease the winning probability from 

at most | to at least | , where s is the number of possible answers Alice and Bob can 
give. This last result shows that in general Raz's bound is close to optimal. 

No-signaling strategies No-signaling strategies are all those strategies which do 
not imply communication. Popescu and Rohrlich PR94] give an example of such a 
strategy: Alice receives a bit x, Bob receives a bit y, and they respond with uniform 
random bits a and b such that a b — x A y. Note that even though we cannot 
implement this strategy with shared randomness and without communication, Alice 
and Bob cannot communicate if they only have black-box access to such functionality. 

The study of no-signaling strategies is motivated by the idea that if Alice and Bob 
share some entangled quantum state, the set of possible strategies they might use 
increases, but stays a subset of the no-signaling strategies (this subset is strict: for 
example the above strategy which achieves a®b = xAy from (x, y) cannot be simulated 
perfectly using quantum mechanics NCOO, Problem 2.3], |Cir80| — the corresponding 
game is called the CHSH-game jCHSH69j ). 

We remark that there are games which can be won with probability 1 given a shared 
quantum state (and thus with a no-signaling strategy), but not using local strategies. 
Those are called "pseudo-telepathy games" (see [BBT05] and the references therein). 

A parallel repetition theorem for the case where Alice and Bob share a quantum 
state and the decision of the referee only depends on the XOR of the binary answers 
of Alice and Bob was recently given by Cleve et al. [CSUU06] . 

Contributions of this paper In this paper we simplify Raz's proof. Most impor- 
tantly, we replace a large part (essentially Section 6) of Raz's paper with the simpler 
Lemma [5] This also allows us to give an explicit bound on the maximal winning prob- 
ability of a game repeated n times in parallel (Raz does not explicitly describe the 
dependence of v on «). 

The use of Lemma |8] also makes the rest of the argument simpler. We shortly 
explain why: The main part of the proof consists of showing that the information the 
players get in the n-fold repetition does not help them to win the subgame in some 
coordinate j, even conditioned on the event that certain other subgames are won. This 
is done in three steps. In two of these steps the information does not help the players 
because they can generate this information themselves with local computation only. 
Lemma [8] shows that this also holds for the third step. This allows us to merge some 
of the steps, which simplifies the overal structure. 

1 For readers not familiar with such counter-examples, a variation of Fortnow's game is reproduced 
in Appendix lAl 
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We also study how much the term log(s) in the exponent in the parallel repetition 
theorem can be reduced. In [PRW97] it is shown that the logarithm of the partition 
number of the accepance predicate can be used instead of log(s). Based on the ideas 
from there, Theorem [IT] gives a bound which might be stronger for some games. 

Finally, we prove a parallel repetition theorem in case Alice and Bob are restricted 
to no-signaling strategies (in both the given game and the parallel repetition of it). 

2 Notation and Basic Facts 

2.1 Probability Distributions 

We use calligraphic letters to denote sets. We denote random variables using capital 
letters, and values with lower case letters. We use superscripts to denote tuples, e.g., 
X n := (Xi, . . .,X n ) and x n := (xi, . . .,x n ). 

If a distribution Pxy over X x y is given, we write Px or Py to denote the marginal 
distribution, e.g., Px(x) '■= Y^y^y^ xy{x,v)- The conditional distribution Py\x=x 1S 
Pv\x= x (y) ■= Pxy{x, V )/Px{x). 

Let Px be a distribution over X and Pyi\X\=x be a conditional distribution over y. 
We define the distribution P XqPy^Xx over X x y as 

(Px PY,\ Xl )(x,y) := Px (x) ■ P Yl \x 1=x (y)- (1) 

For this, it is necessary that Py 1 \x 1 = x is defined for every x G X. We also use 
this notation when PyAXi=x is defined as marginal of a given distribution PjfiYi- 
In this case, we define Py 1 \x 1 =x m an arbitrary way if Pxtix) = 0. This notation 
is used for example in Corollary [9] in the form Px y Ps\x, where it is understood 
as (Px Y Ps|x)(a ; ! V, s) := Px y (x, y)P s\x=x( s )- Note that the conditional distribu- 
tion Ps|x=a; is defined there by the marginal distribution Psx of the given distri- 
bution Psxy- Our notation is not explicit since it does not specify which random 
variables are associated with each other. However, this will always be clear from the 
context. 

For two probability distributions Px and Pjfi over the same set X we define the 
statistical distance 

II Px - P Xl || := ~ J2 I P *o (x) - Px, (x) | . (2) 

x£X 

2.2 Games 

Definition 1. A game © = (Pxy,Q) over XxyxAxBisa distribution Pxy 
over X x y and a predicate Q over X x y x A x B . The value v(<8) of a game is 

w(«) := max Pr [Q(X, Y, h a (X), h b (Y))}, 

h a: hb XY 

where the maximization is over functions h a : X — > A and : y — * B . A strat- 
egy (h a , ht) for a game is a pair of such functions. 
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Sometimes also randomized strategies for Alice and Bob are considered, where h a 
and hb also depend on (the same) shared randomness r chosen according to some 
distribution P^. However, there always exists anrGK such that 

Pr [Q(X, Y, h a (X, R), h b (Y, R))\ = E[Pr [Q(X, Y, h a (X, R), h b (Y, R))}] 

RXY Ft XY 

< Pv[Q(X,Y,h a (X,r),h b (Y,r))], (3) 

XY 

and we see that the definition of the value is robust against such a change. Individual 
(local) randomness can be obtained from shared randomness and is thus a special case 
of the above. 

Definition 2. The n-fold parallel repetition 0™ of a game © = (Pxy,Q) over X x 
y x Ax B is the game over X n x y n x A n x B n which is given by <3 n :— (Px™y™, Q An ) 
where 



Px"Y"(x n , y n ) := \\ Pxy(xi, ?/;), and 

i=l 
n 

\x n ,y n ,a n ,b n ) := /\Q{x tl y t , ai M)- 



If a strategy is given, the distribution Px"Y n A n B n of queries and answers is defined 
in the obvious way. We further define, for all i, the event Wi which occurs if the ith 
subgame is won. 

Definition 3. For a game 25™ and a strategy (h a ,hh) the distribution P x^Y n A n B n 
over X n x y n x A n x B n is given by 



Px~Y4x n ,y n ) ifK{x n ) = a 11 and h b {y n ) = 6" 
otherwise. 



Further, W n is the tuple of events (Wi, . . . , W n ) where Wi : <^=> Q(Xi, Yi, Ai, Bi). 

We prove the following version of the parallel repetition theorem. 

Theorem 4 (Parallel Repetition Theorem). For any game with value v :— v(<3) 
and any integer n: 

v(<sn<(i- {1 ~ v)3 ' 



6000 

The constant 6000 could be improved by a more earful analysis (we will not optimize 
constants which would improve it during the proof). However, we do not know whether 
the 3 in the exponent can be reduced. 

In |PRW97j it is shown that in Raz's proof the term log(|„4| \B\) in the exponent can 
be reduced to the maximum of the logarithm of the partition number of Q(x 7 y, •, •). 
As shown by Beame [Bea06], the argument can be adapted to work with the proof 
given here. We give a slightly different argument in Section [8] which shows how the 
term can be reduced to a quantity which is a lower bound on the logarithm of the 
partition number. 
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3 Proof Sketch 



Fix an arbitrary game ©, its n-fold parallel repetition (5™, and a strategy h a , hf, for (5™. 
With the notation from Definition^ the parallel repetition theorem is simply an upper 
bound on Pr[Wi A • ■ • A W n ]. To get such an upper bound, we show that for arbitrary 
indices i\, . . . ,i m there exists an index j such that 

Pr[W j \W h A---AW im ]<v((&)+e, (4) 

where e depends on m, n, log(|.4||2?|), and PrfW^ A • • • A W im ] (this is Lemma IT5j) . 
From a simple induction gives the parallel repetition theorem, thus we now con- 
centrate on the proof of ^ . 

Locally Computable Embeddings In order to prove (H|) we define the distribution 

Pjf„y„ := Px«r™|w il A-AWi TO (5) 

(i.e., the distribution of the message which the referee sends to Alice and Bob condi- 
tioned on the event that the games i\ to i m are won). 

We show (Lemma H4"]) that for some j the following can be achieved by Alice and 
Bob without communication and using shared randomness only: 

1. Alice, on input x, produces a tuple x n with Xj = x. 

2. Bob, on input y, produces a tuple y n with y~j = y. 

3. Let Py"y" be the resulting joint distribution of the tuples (x n ,y n ), assuming 
that (x,y) is chosen according to Pxy- Then 

|j P _„_„_p^|| < £ . 

We say that (X, Y) can be 1 - e-embedded into [X n ,Y n ) with (Aj, Y,) = (A, Y) by 
local computation. 

If such an embedding is given, we can consider the following strategy for the initial 
game <S: Alice and Bob embed their inputs (A, Y) in (A", Y n ) with (Xj,Yj) = (A, Y), 
and answer with coordinate j of h a (X n ) and hi } (Y n ). This strategy wins with proba- 
bility at least Pr[W^|W / i 1 A ■ • • A Wi m ] — e. Since no strategy for the initial game has 
higher winning probability than v(<S) this implies (j4|). 

We remark that a necessary condition for such an embedding to exist is that 

I|Pxy-P^ ? .||<£, (6) 

and indeed this follows from Lemma [5] for Uj — (Xj , Yj ) (of course this condition is 
not a sufficient one). 
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Constructing an Embedding We now give a more detailled explanation how Alice 
and Bob can embed (X, Y) into (X n ,Y n ) with (Xj,Yj) = (X, Y). For this, given 
values (x, y) distributed according to Pxy, Alice and Bob proceed as follows: 

1. Alice and Bob use shared randomness to produce queries and responses for all 
the won games, i.e., values (x^ , yt t , , ) to (x* m , yi m , dj m , &£ m ) . Here, Alice 
and Bob both produce all these values. 

2. For every index i (fc . . . , i m , j}, Alice and Bob examine a shared random 
bit di. If di = 1 both locally produce Xi, otherwise both locally produce y*. 
Again, Alice and Bob both produce all these values. 

3. Using individual randomness, Alice and Bob locally expand their information 
such that Alice gets x n and Bob y n . 

In steps 1 and 2 we have to take care of two things: first, the values produced 
should be distributed according to the the respective marginal of the distribution 
^ ' A n B n x n Y n \x =xAY =y ( wnere P £ n B n x n Y n ^ s defined analogously to ((SJ)). Second, 
Alice and Bob should produce equal values (otherwise the resulting random vari- 
ables (X , Y ) will not have the correct overall distribution) . 

For step 1 achieving both is simple: it follows from Corollary [6] that Alice and Bob 
can choose the values (x^ , y^ , , ), . . . , (xi m , yi m , ai m , bi m ) independently of (x, y) 
according to Pj^ y A b ---x y a b • Using shared randomness this can be done 
such that both get the same tuple. 

The second step is harder, as in this case the values cannot be chosen independently 
of (xj,yj) anymore^ However, let S be the random variables which Alice and Bob 
produce in this step. It will follow from Corollary [S] that ||PxyPg|jf . — PxysII an< l 
||PxyPg|y-. — PxfsII are both small, and Lemma [8] implies that this is sufficient to 
generate S locally. 

In fact, Corollary and Lemma [5] are strong enough to do steps 1 and 2 at the 
same time, and thus these steps are done simultaneously in the proof of Lemma RBI 

Step 3 will be simpler to implement. Because the players also computed a% t and &, f 
in step 1, they can expand their known values according to the given distributions and 
the resulting distribution will be correct (this follows from Lemma [TUl and a detailed 
explanation is in the proof of Lemma fTi]) . 

4 Conditioned Distributions 

The following lemma is essentially Claim 5.1 in Raz's paper [Raz98j (and we use the 
proof given there). It states that if random variables Ui are chosen independently, 

2 The values di can be chosen independently, but not the values of Xi respective yi. We quickly 
explain why this is impossible in general. Assume that the random variables X and Y contain a shared 
bit B. The game © n and the strategy (h a ,hi>) may be such that Alice and Bob win subgame i\ in 
case -Biffi- • -ffi-B n = 0. Generating the values independently of (x, y) would now produce a distribution 
with statistical distance at least i from the target distribution. Therefore, a bit which is contained 
in both x and y must be considered when generating the values of Xi and yi . 
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then conditioning on an event does not change the individual distributions a lot on 
average. 

Lemma 5. Let P^t := . . . Pjj k be a probability distribution overlA k , W an event. 
Then, 

Pr[W] < 2~ Y '^ lPu ^ w ~ Pu i^ 2 . (7) 

As an example, let Ui be uniform and independent bits and W be the event that 
at least fc(| + e) of these bits are one. Then H-Pu^vp — P[/< II > £ and the lemma states 
that Pr[W] < 2~ k£ , which is a version of Chernoff 's inequality (note that this implies 
that Lemma [5] is almost tight; see, for example, |HR06] V 

Using a j) 2 — ^J2j=i a j one eas ily checks that ([7]) implies 



Ell P ^l^-P^II<^log(p^]) - (8) 
which is the form we use later. 

Proof. For two distributions Ps and Pt over the same set iS, the relative entropy 
D{P S \\P T ) is defined as 




This quantity satisfies D{P S \\P T ) > (II p s - Pt||) 2 (see }CT91( Lemma 12.6.1]). Also, 
if Pxjh — P(/i • ■ ■ Pu k and P V k are distributions over the set U k , then 2j=i -^(Pv, II Ptfj ) < 
D{P V k || P V k) (see Appendix EJ. 
Using the above we get 

E(n p ^- p ^n) ^E^( p 

< £)(P[7-l>|wll P l7*) 

V^n ( fcM / P [/ fc l^(« fc )\ 

=E p ^i^(« fc )io g (-^)-J 

- l0 £(^F]) + E log(Pr[^|[/* = U fe ]) 

< l0gf : rV □ 
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We now give a slight extension of this lemma (this makes it simpler to apply later) . 
First, the Uj are independent given the value of an additional random variable T. 
Second, an arbitrary third random variable V with bounded alphabet size gives side 
information about Uj . Then, choosing Uj without considering the fact that an event W 
happened and ignoring V does not change the distribution of Uj too much on average. 
For the notation in the following corollary we refer to Section 12. 1[ equation (TTJ) and 
the subsequent remarks. 

Corollary 6. Let PTU k v := PtPui \tPu 2 \t ■ ■ ■ Pu k \TPv\TU k be a probability distribu- 
tion over T x U k x V , W be an event. Then, 



Y\Ptu,v\w -^tv\wPu 3 \t\ < ^y lo g(l V *l) +1 °g(^^7])' 

where V* := {v G V\P v \ w (v) > 0}. 

The proof is essentially an application of Jensen's inequality on Lemma [5] 

Proof. Fix a pair (t, v) G T x V and consider the distributions Pjjk\ T — t y =v arL d 
Pu k \T=t- We apply Lemma [5] (in the form given by |8])) on these distributions (with 
the event (V=v) A W) and get 

k k 

2^j^ TU 3 V \ W ~ PTV\wPUj\T — ^ PTV\w(P)V) ■ ^ || Pjj i \T=t,V=v,W ~ PUj\T=i 
3=1 t£T,v£V j=l 



< p Tv\w(t,v)J A; log 



Pr[W A V = v\T = t] 



^/ fel0g ( E P ^ w{t > V) PrlWAV = v\T = t] )> 
V teT, tigv* L 1 1 

(10) 



where the last inequality is Jensen's inequality applied on the function ^/log(-) which 
is concave on [l,oo). We compute 

^ 1 \^ Pr[r = t A V = v\W] 

2^ TV\w{t,v) = - 2^ Pr [WAV =v\T = t] 

Pr[T = t A V = v A W] Pr[T = t] 

^ PrlW] Pr\V — v AT — t AW] 

teT,«6V* 

Pr[r = i] _ |V*| 



Pr[W] Pr[VF] ' 

Inserting this into (|10[) completes the proof. □ 
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5 Embedding by Local Computation 

We next study under what conditions random variables can be embedded into other 
random variables by local computations. 

Definition 7 (Embeddable). For two distributions Px Vb an< ^ ^'x 1 sy 1 t we say that 
(Xq,Yq) is 1 — e -embeddable in (XiS,Y\T) with (Xi,Yi) = (Xq,Yq) if there exists a 
probability measure Pr over a set 1Z and functions fA'-XxlZ^S, fs '■ y X TZ- — ► T , 
such that 

\\Px y„Pf a f b \xy - PjfiYiST|| < e, 
where P F A F B \x=xY=y *s the distribution defined by the random variable (fA(x,R), 

f B (y,R))- 

The following lemma gives a condition under which (X,Y) is embeddable in (XS, 
YS). ft is one of the main contributions of this paper. 

Lemma 8. Let a distribution Psxy be given. If 

\\Psxy-PxyP S \x\\ <ei (11) 

and 

\\Psxy -PxyP S \y\\ <£ 2 , (12) 

then (X, Y) is 1 - 2ei - 2e 2 -embeddabl^ in {XS, YS). 

Even if E\ = e 2 = 0, equations (fTTj) and (|12p do not imply that 5 is independent 
of X and 1". For example, if X and Y contain the same uniform random bit, then S 
can depend on this bit. However, if ei = e 2 = the lemma is obviously true: Alice 
uses shared randomness to choose S according to Ps\x=x (more concretely: Alice 
chooses a uniform random real p £ [0, 1] and uses the smallest element s for which the 
cumulative distribution function X)s'<s Ps\x=x(s') is larger than p). Since Bob has 
the same distribution P ' s\Y=y he will find the same value if he uses the same shared 
randomness. 

In case £\ > and £ 2 > 0, we have to overcome the following problem: P$\Y=y is 
unknown to Alice (since y is unknown to Alice), and analogously P s\x=x is unknown 
to Bob. The solution is to define the function f & : X x 1Z — > S with the following 
process: Alice chooses, using shared randomness, a uniform random element s from 
S and a uniform random real number p e [0,1]. If Ps\x=x( s ) > P she outputs s, 
otherwise Alice repeats the above. The function fs ■ y x 7Z — > S is defined by the 
analogous process given y. It is easy to see that Alice outputs elements according to 
the distribution Ps\x=x, Bob according to Pg\ Y =y We further show that usually the 
output of /a is equal to the output of fs ■ 

3 It is understood that the embedding satisfies (X, Y) = (X,Y), i.e., that the original random 
variables will result if from the resulting (XS, YS) the 5-part is omitted. 
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Proof. Let 7Z := (S x [0,1])°° be the set of infinite sequences over S x [0,1]. For 
a fixed x,y and a sequence r := {(si, pi)}i>o, we define fA(x,r) := Si if i is the 
smallest index for which Ps\x=x( s i) > Pi- Analogously, r) := Sj if j is the 

smallest index with Pgt Y — y (sj) > Pj and_| fAB{x,y,r) := Sfe if fc is the smallest index 
with P s\x=xY=y( s k) > Pk- If no such index exist the respective function is defined in 
an arbitrary way (this happens with probability 0). 

Let PxYF A F B F AB be the joint distribution of (x, y, /a(x, r), f B (y, r), Jab (x, y, r)) 
where (x, y) is chosen according to Pxy and r uniformly from 1Z. We have P p AB \x=xY=y = 
Ps\x=xY=yi Pf a \x=x = Ps\x=x and P FB \ Y = y = Ps\Y= y , since these equalities hold 
conditioned on the event that the respective function accepts in round i, for any fixed i. 

Further, we have Pr[F A = F AB \X = x,Y = y] > 1 - 2\\P FaIX=x - P FabIX = x y= v \\- 
the two values Fa, Fab are equal if pj < mm(Pp A \ x =x( s j)i P F AB \x=xY= y ( s j)) f° r 
the smallest j for which p rj < max.(Pp A \ X — x (sj),Pp AB i x — X Y=y(sj)) is satisfied. This 
happens with probability 

J2 S ™in(PF A \X=x( s j): P F AB \X=xY= y ( s j)) _ 1 ~ I|Pf a |X=3; - P F AB \X=xY=y\\ 
Y,s max ( P F A \X=x(sj),P FAB \X=xY= y {sj)) 1 + \\P Fa \X=x ~ P FAB \X=xY=y\\ 

> 1 — 2||P Fa | X=2 . - P FAB \ X=xY=y \\- 

This yields Pt[Fa = F A b] > 1 — 2ei, and analogously we get Pr[F B — Fab] > 1 — 2£2, 
and thus Pr[F A = F B = F A b] >l-2e 1 - 2e 2 . This implies 

||Pxfss — P xyP f a f b \xy\\ = \\Pxyf ab f ab — P XY F A F B \ > 1 — 2ei - 2e 2 - □ 

In the following corollary, the input distribution is changed slightly. This makes it 
a bit easier to apply later. 

Corollary 9. Let distributions Psxy and P x a Y a be given. If 

\\PsxY-Px Y Ps\x\\<£i (13) 

and 

\\Psxy-Px y Ps\y\\ <£2, (14) 

then (X , Yo) is 1 - 3ei - 2e 2 -embeddabl^ in (XS, YS) with (X, Y) = (X , Y ). 

Proof. From p3[) we get ||Pxy — Px >oll — £ i- One can now find a joint distribution 
PxYX Y with Pr[(X,Y) = (Xo,Yq)] > 1 — E\. The corollary now follows by applying 
fA and f B from Lemma [S] □ 

Random variables S, T, U form a Markov chain, written S <-> T «-> [/ if Pstc/ = 
PtPs|tP[/|t (i- e -i if given T the probability distribution of [/ does not depend on S). 
The following lemma is essentially Lemma 4.1 in Raz's paper. 

4 The use of f A g in order to simplify the analysis was suggested by Anup Rao. 

5 The statement could be made symmetric (i.e., (Xq, Yo) is 1 — 2ei — 2e2 — min(ei, £2)-embeddable). 
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Lemma 10. Let Pxyst be any distribution. If 

S X ^YT 

and 

XS ^Y ^T 
then (X, Y) is l-embeddable in (XS,YT). 

Proof. Using individual (non-shared) randomness, Alice computes S according to Pg\ x =. 
and Bob computes T according to P T \ Y —y Since 

Pstxy = PxyPs\xy^t\sxy = PxyPs\xPt\y (15) 
this gives the correct (global) distribution. □ 

6 Embeddings for Games 

Given a game 25 and its n-fold parallel repetition, we now show that (X, Y) can be 
embedded into (X n ,Y n ), where P Xnyn := Px n Y n \w k+1 A-'AW n - 
We need the following simple fact on statistical distance. 

Fact 11. Let P z a and P z x be distributions over Z. Let S C Z be such that Pr[Zo G 
S] =Y>r[Zx 6 5] = \. Then, 

llPzolZoes-Pz^esll^llPzo-PzJ- 

Also, we need the following statements about Markov chains. 

Claim 12. Let P x q y q P x^y x be a distribution over Xq x y>o x X\ x y% , / : Xq x X\ — > li 
and g : yo x 3^1 — > V fee arbitrary. Then, 

X X 1 «- X f(X ,X 1 )Y 1 g{Y ,Y 1 ) <-> F >i- (16) 

Proof. It is sufficient to show this for all possible values xo S <%o and yi G D^i- 
Let P y x 1 := p Y„x 1 \x Q =x Y 1 = yi = P Yo \x =x P x 1 \y 1=Vi - In this case, ([T6| reduces 
to 

X x ^ f(x ,X 1 )g(Y ,yi)^Y . 
Since X\ and lo are independent this is obvious. □ 
Claim 13. Let Ptuv be a distribution over T x hi x V and an event with 

W ^ TV. 

TTien, /or Pf^y := Ptuv\w we have 
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Proof. 

Pfjjy(t,U,v) = P TlJV \ W {t,U,v) 

= Pu\w( u ) P TV\U=u,w(t, v ) 
= Pu\w(u)PTV\U=u(t,v) 

= Pu\w(u)P T \u=u(t)Pv\u=u(v) 

= Pu\w(u)PT\U=u,w(t)Pv\U=u,w(v)- □ 

Lemma 14. Let a game <S n = {Q n , (Pxr)"), a strategy (h a , h^), and k < n be given. 
Let 

Px„y„ := Px"r™|Wfe + iA-AW„ 

Then, for 1 < j < k, there exists Sj > smc/i £/iai (X, K) is 1 — Sj-embeddable 
in (X n ,Y n ) with {Xj,Yj) = (X,Y) and 

g e ^ ^ 15 ^ ^ - fc ) ^ + log ( p r [^. +1 a ■ ■ ■ X Wj) • (17) 

Proof. As described in Definition [3] we consider the distribution P x n Y n A n B n W n and 
the corresponding random variables. Additionally, we let Di, . . . , D k be uniform and 
independent bits. For 1 < j < k we define 



and 

U 

Also, we set 



Xj if L>j = 
Yj otherwise 



Y j XDj = 
Xj otherwise. 



T:= (X k+u ...,X n ,Y k+1 ,... 1 Y n ,D k ,U ), (18) 
V := (A k+1 ,...,A n ,B k+1 ,...,B n ), (19) 



and define the event W := Wt+i A • • • A W n . 
From Corollary [5] we get 



yj PtI^VIW ~ PTV\wPUj\T < ETot , (20) 



where we set 



£Tot := Vk J (n - k) \og(\A\\B\) + log(p^Fj) (21) 
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(we applied Corollary [6] using |V*| < |V|). 

In (f20]) . we condition on both sides on the event Dj — 0, which is, on both sides, 
a restriction on a subset which has probability 5 . Fact [11] implies 



/] \\PTUiV\w/\(Dj=o) ~ Ptv|wa(d j =o)P;7 j |t 



< 2e To 



(22) 



where we do not need to condition on Dj — in Pjj \t since this is included in the 
given t anyhow; in fact we can now write P x \y instead of Pu-\t- 
For a fixed j, define the random variable 



I'M := (X k+1 ,...,X n ,Y k+1 ,...,Y n , 

Di, . . . , Dj-i,Dj + i, . . . , D k , 

With this notation (|2"2"|) is equivalent to 

fc 

^ 1 1 Pj(\ 3) y< V I WA ( £>< =01 ~ PT<-M'>YjV\WA(Dj=a)PXj\Y : j < 2e T o 

J=l 

But now nothing depends on Dj = anymore, so this also means 

fc 



< 2e To 



We set S := (T^\V) and define the probability distribution 
With this, (US]) becomes 



Psx n Y" : ~ Psx™y™|w- 



(23) 



(24) 



(25) 



(26) 



or, equivalcntly 



Lemma [5] implies 



\\ P sx 3 y 3 ~ P sy 3 P x 3 1 Y 3 

j'=i 



3=1 



<2e To 



<2£ To 



fc 



< ^Totj 



(27) 



(28) 



(29) 
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and thus 



J2\\ p sm- PxYP s\yJ\\ ^ 3£Tot - ( 3 °) 

i=i 

Symmetric reasoning yields 

k 

E| P S^- P ^ P S|^|< 3£ Tot. (31) 
J = l 

From d30j) and ([31]), Corollary [9] implies that (X, Y) is 1 - e r embeddable in (XjS, YjS) 

with (Xj, Yj) = (X, Y) and such that Ylj=i £ j — 15 £ Tot- 
We next show that 

X k ^TV^Y k . (32) 

If the bits D k and the values Xk+i, ■ ■ ■ , X n , Yk+i, ■ ■ ■ , Y n are fixed, this follows imme- 
diately from Claim [T3J Since it holds for all these values it must also hold overall. 
From (13"2"]) we easily get 

X n «-> XjS «-> F"Fj5 

Claim [13] yields 

P <-► «-» Y n YjS (33) 
X n XjS *-* YjS <-> F". (34) 

Above we have seen that (X,Y) is embeddable in (XjS, YjS) with (X,-,Yj) = 
(X,Y). Lemma [TOl together with ([33]) and ([34]) now implies that we can 1-locally 
embed this in (X n XjS,Y n YjS). Since Alice and Bob can then ignore part of the 
constructed information this completes the proof. □ 

Lemma 15. Let a game = (Q, P xy), its n-fold repetition (5™, and a strategy (h a , hb) 
for (S™ be given. Let indices i\, . . . , i m be given. Then, there exists an index i m +i such 
that 



Proof. First, we can assume that the given indices <l <m, are pairwise different 
(otherwise we get a stronger statement). Given this we can even assume that i( — 
n — I + 1 by appropriately redefining the functions (h a , hb)- 
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Define the distribution Pj^„y„ := Px n Y n \w n - m +iA—AW„- Lemma fl4l implies that 
there exists an index j such that (X, Y) is 1— e-embeddable in (X n , Y n ) with (Xj , Yf) = 
(X, Y) and 



15,/ ,/mlog(|.A||B|)+log( 

V n — m \ V 



Pr[W„_ m+ i A • • • A W n 



Consider the following strategy for 0. On input (X, Y) Alice and Bob 1 — e-embed this 
into (X n ,Y n ) with (Xj,Yj) — (X,Y). Since the resulting distribution has statistical 
distance at most e from P^„y„, if they output coordinate j of h a (X n ) and hf,(Y n ) 
they have probability at least Pr[Wj| W n -m+i A • • • A W n ] — e to win the initial game. 
The shared randomness can be eliminated (see the remark after Definition [T|), and thus 

w(«) > Pr[Wj-| W n - m +i A • • • A W n ] — e. □ 



7 Parallel Repetition Theorem 

Proof (of Theorem^. Fix a strategy (h a ,hb) for <S n . Then, repeatedly choose the 
index i m+ \ for which Pr[Wj m+1 [W^ A • • • A W im ] is minimized. We set p := 1 and 
p m := Pr[Wij A • • • A W im ]. Lemma [151 implies 



p m +i <Pm- [v + 15\l—!—Jm\og(\A\\B\) + \og(—) ). CM,) 
We show per induction that 

■1+v 



Prn < ( : 

TO,. 

2700 log(|.4||Z3|) ' 



as long as m < 2700 i2g(|^4 1 [g] ) ■ r ^ nc statement holds for m = and we now make a step 
from to to m + 1. First, we can assume that p m > (I±E) m+1 > l m+1 ^ as otherwise 
the induction step is trivial. In this case, (|36|) yields 



Pm+l < ' W + 15 



m/ — ^ — V^logCI^MSI) + (m + 1) 
V rt — to 



— V675TOlog(|^l||S|)) (37) 



< Prn ' « + 

V n — m 

Since we assume to < 2700 , — roilBTl ( n — TO ) ^his P roves the induction step. 



In total we get for to = — ^ 



2700 log(|^||8|) 1 
n(l — v 
3000 log(|.4||Z3|) 



Pm < — r— ■ ( 38 ) 
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We have 



(i-»r , -, . ti— 'r 

3000 



V\ 3000 / 1 — V 



< 1 - ^ (39) 

6000 ' v 1 

where the last inequality follows from (1 — b) a < 1 — ab which holds for all a E [0, 1], 
b < 1. Since Pr[Wi A • • • A W n ] < p m , © and ([21]) imply the theorem^ □ 



8 Improving the Rate 

Theorem U shows that the n-fold parallel repetition reduces the winning probability 
from v{<£>) to (1 - 9(1 - y(0)) 3 ) o( \ As shown in |PRW97j . the term • \B\ 

in the exponent can be reduced to the the maximum (over x, y) number of (fractional) 
rectangles needed to cover the 1-entries in Q(x,y, •, •). Here, we show that it can be 
reduced to a quantity which is possibly smaller in some cases. 

Definition 16 (Exact Fractional Product Cover). Let Q : A x B — > {0,1} be an 
arbitrary predicate. Two functions f : A X {1, . . . , a} — > [0, 1] and g : B X {1, . . . , a} — ► 
[0, 1] form an exact fractional product cover of size a for Q if for all a, b: 

a. 

Q( a ,b) = ^2f(a,i)g(b,i). 

i=i 

Clearly, any partition by rectangles gives an exact fractional product cover (by 
defmining f(a,i) and g(b,i) as appropriate predicates). We will prove the following 
strengthening of Theorem [U 

Theorem 17. Let © = (PxYtQ) be a game. Let a be such that for all (x,y) there 
exists an exact fractional product cover of size a for Q X:V (a,b) := Q(x,y,a,b). If a > 1 
then 

v ' ~ V 6000 J v ' 



6 The minimal value of the sequence defined by po := 1 an< i Pm+i := Pm(y + 
\J n—m \/ m ^ + l°g(l/P™)) i s indeed ^1 — 0((1 — t)) 3 )^ 7 . The argument in the proof above shows 
that the minimal value can only be lower. On the other hand, the sequence given by p' := 1, 
P'm+l := Pm(y y^n) ' s s trictly smaller than the sequence {Pj}j>o- This sequence does not 
decrease anymore if m > m' := n(l — v) 2 /£, and 

m — 1 rrr / m — 1 j~r^ 

p'm'= EI (« + Vr) =exp E ln ( v + V^ 



(j \tl(v + \J~)\ = exp((4v - 1 + 2v 2 \n(v) 



.:<>,[ j Id ( r - , . — ) j < x] > ( < J ,• - I + 2r~ hi( r) - 3v 2 ) • Jl) * (l - ^ ^ 
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and if a = 1 then 



(1 - v ) 2 \n 



To prove Theorem [IT] we first need a characterization of fractional product covers 
by Markov chains. 

Lemma 18. Let a distribution P abz = PaPbPz\ab be given for which there exists 
functions /(a, z) : A X Z — ► [0, 1] and g(b, z) : B x Z — > [0, 1] which satisfy 

P z\A=aB=b{z) = f(a,z)-g(b,z). (42) 

Then, A <-> Z «-> £? . 

Lemma [18] could be strengthened as follows: if P^|ab is such that A <-> Z <-> i? 
for all distributions PaPb, then P^as is of the form (j42|) for some functions / and g. 
For completeness, we prove this in Appendix [C] 

Lemma [TBI implies the following: if Q : A x B — > {0, 1} has a fractional product 
cover of size a, then there exists a random variable Z over some set Z given by a 
conditional distribution P z\ab with the following properties: 

• For any product distribution P ab = Pa P b we have A <-> Z <-> B 

• |{z € 2|3o, b : Q(a, b) = 1 A P Z | A=aS=b (z) > 0}| < a 

• Q(a,b) can be inferred from z. 

(Note that we do not restrict the alphabet size of Z in case Q(a, b) = 0, which means 
that in this case z can be, for example, (a, &).) 



Proof (of Lemma [T8\) . We get 

P A\B=bZ=z{o) 



ABZ 



(a,b,z) 



P B z{b,z) 
= PA(a)PB(b)f(a,z)g(b, z) 
£„, P B (b)P A (a')f(a',z)g(b,z) 

= PA(a)f(a,z) 
£ a , P A (a')f(a',z) 

Z b > PA(a)PB(b')f(a,z)g(b',z) 
Ea'^PA(a')PB(b')f(a',z)g(b',z) 

_ Paz(ci, z) 

Pz(z) 
= Pa\z=z(o), 

and thus P A Bz(a,b,z) = P z{z)P B \z=z(b)P A \B=bz=z(a) = P z(z)P B \z=z(b)P A\z=z(a), 
which means that A <-> Z <-> £>. □ 
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Given the characterization from Lemma [18] we can now prove Theorem 1171 
Proof (of Theorem\17\). We first show that Lemma [T4l still holds if we replace (fT7|) by 



£j < ^VkJ (n - k) log(a) + log(— 



[W k+1 A • • • A W„] 



(43) 



For this, we define the random variables D k , U k , U" , and T exactly as in the proof 
of Lemma [HI Instead of (H~9l) we now define 



V:= (Z k+1 ,. 



(44) 



where Zi is obtained from (Ai, Bi, Xi,Yi) by a channel that has alphabet size at most a 
in case Wi, which ensures Ai «-> XiYiZi <-> £>i in case Ai and -B^ are independent, and 
for which Wi can be inferred from (Xi,Yi, Zi). The existence of such a random variable 
is ensured by Lemma [TBI and the fact that for every (x, y) there exists a exact fractional 
product cover of size a for Q(x, y, •, •) (the alphabet size of Z in case Q(x, y, a,b) = 
is irrelevant and Z can be defined, for example, as (A, B) in this case). 
From Corollary [5] we now get 



3=1 



< £Tot ) 



where we set 



£Tot 



:= Vk^ (n - k) log(a) + lo g(^j^y 



(45) 



(46) 



For a fixed j we define T^-?) as in the proof of Lemma [T4l and obtain in exactly the 
same way for S := (T^\ V) and 



SX"Y"\W 



the equations 



3 = 1 



< 3e 



Tot 



(47) 



(48) 



and 



E| P sx^- p ^ P s|^ 



< 3e To 



(49) 



Again, Corollary [5] implies that (X, Y) is 1 — Sj-embeddable in (XjS,YjS) with 
(Xj,Yj) = (X, Y) and such that Ylj=i £ j — 15 £ Tot- 
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Again we get 



x k «-> tv «-> r fe , 



(50) 



now using the properties of the Zi. (This is done as follows: clearly, X k «-> T «-> y fe , 
i.e. for a fixed values i for T the X fc and y fe are independent. Now, inductively adding 
Zi will not change this in any step.) Claim fl3l now yields 



X 1 



X 3 S 



Y n YjS 



X n Xj S ^ YjS ^ Y T 



(51) 
(52) 



and Lemma [TU] completes the proof that (|43p can replace (JTTJ) in Lemma Q3J 

From Lemma [14] where (|TT|> is replaced by (|43|) we obtain (|40|) exactly as in the 
proof of Theorem |4j To get (|4Tj) we note first that in this case (|43|) reduces to 



k 

E 

i=i 



< 15V^/log(j 



1 



vPr[W fe+ i A • • • A W, 
Using an analogous definition for p m as previously, we get 

v + 15 



Pm+l < P 



1 



Pm 



(53) 



(54) 



1 < 



(n—m)(l — v) 
400 



Here, we show per induction that p m < (^j iL ) m as long as ? 

To make a step from m to m + 1 we can assume p m > (i^) m+1 , which implies 
Pm > 2~^ 1_w ^' m+1 ^ (since (1 — ^j) 1-0 < 1 — see inequality below), which means 
that 



Pm+l < Pn 

for relevant values of to. If to + 
get for to = "W'^ 



225(m+l)(l-«) 



(55) 



1 < ^'^"oq 1 V) this implies the hypothesis. We thus 



Pm < 



(^r 



Finally, (i±«)(i-«)/i800 
1 - ab for a G [0, 1], 6 < 



(i-u) yi-^/isoo < ^ 



3600 



again using (1 — b) a < 
□ 



7 Note that in case p m < \1 — 0((1 — v) 2 )J equation (1541 ) only implies p m +i < p m (o + 

15V-log(l - G((l -v) 2 ))) = p m (v + 15^/0({1 - v) 2 )) = p m (v + 1 -v) = p m , and thus J54J cannot 
be used to get a significantly stronger version of the theorem. 
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9 No-signaling Strategies 

No-signaling strategies are those where the only restriction on the response of Alice 
and Bob is that they do not imply communication. 

Definition 19 (No-signaling). A pair (h a ,hb) of functions is no-signaling if h a : 
XxyxTZ^Aandh b : XxyxTZ^B satisfy 

PT[h a {x,y,R)}=Pr[h a (x,y',R)} 

R R 

Pr[h b (x,y,R)] = Pr[h b (x , ,y,R)], 

R R 

for all x,x',y,y'. 

Definition 20 (No-signaling value). The no-signaling value v ns (<&) of a game © = 
(PxyiQ) over XxyxAxBis 

v ns (<3) -max Pr [Q(X, Y, h a (X, Y, R),h b (X, Y.R))], 

XY R 

where the maximum is over all no-signaling functions (/i a , h b ). 

Clearly, v(0) < v ns (<8), since any local strategy is a no-signaling strategy. We 
further note that for no-signaling strategies v ns (& 2 ) > (v ns (<3)) 2 is also possible, 
similar to the local case (see Appendix |A"|) . 

We will prove the following Theorem: 

Theorem 21. For any game (5 with no-signaling value v ns :— u„ s (0) and any inte- 
ger n: 

v ns (® n )< (l- (1 ~"" s)2 r. (56) 
v ' - \ 6400 / v ' 

We remark that the proof of this theorem will be much simpler than the proof of 
Theorem 01 

We first show that if Pxyst a distribution which is close to no-signalling (i.e., 
IIPxfs — PxyPs\x II and Pxyt — PxyPs\y) then there exists a no-signalling strategy 
which produces value which are statistically close to S and T from X and Y. 

Lemma 22. Let Pst, Ps' be arbitrary distributions over SxT and S, S and T finite. 
Then, there exists a distribution Pg^ such that 

||Pst-Pst||<||Ps<-P S || (57) 
||P y -P s ,||=0 (58) 
||P ? -Pr||=0. (59) 

Proof. We change Pst gradually to P-^ such that in the end 158]) and (|59"j) hold. 
For this, fix values sq and si with 

Ps(so) < Ps'(so) and P s (si) > P S '(si). (60) 
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Then, as long as ([60]) holds find a value t for which Pst(si, t) > 0. Decrease Pst(si, t) 
by e and increase Pst{sq, t) by e, such that afterwards Psr(si,t) = or ([60]) does not 
hold anymore for sq, si. After a finite number of repetitions (|60[) is not true anymore, 
and we start the process over again with new values for Sq, s\. However, this can also 
only happen a finite number of times, thus the process terminates. 

If (|60|) cannot be satisfied then clearly |58|) holds. We never change P-r(i) for any t 
which implies (|59|) . Finally, (157|) is ensured by the fact that we only decrease || Pg' — Ps || 



and do not change Pst more than P5. □ 
Lemma 23. Let P x y o-nd P xyst be arbitrary distributions. If 

\\PX Y P S \X-PXYS\\ <£l, (61) 

I|Px YoPt|y — PxftII < £2, (62) 

then there exists a conditional distribution Ps'T'\x'=xY>=y with P s>\x'=xY'=y = Ps'\x>=. 
and PT'\x'=xY'=y — PT'\Y'=y such that 

\\Px y P S >t>\xy ~ PxystW < 3ei + 2e 2 . (63) 



Proof. For fixed x, y we define P s T \x=xY=y using Lemma [2"2l with the following prop- 
erties: 

\\P S T \X=xY=y — PsT\X=xY=y\\ < ||Ps|X=a — P ' S\X=xY=y\\ 
\\Ps \X=xY=y - Ps\X=x\\ = 
\\PT \X=xY=y ~ PT\X=xY=y\\ = 0. 

Then, again using Lemma l22l we define P s'T'\x=xY=y such that 

\\Ps'T'\X=xY=y - Ps T \X=xY=y\\ < \\Pt \Y= v ~ P T \X=xY=y\\ 
\\PT'\X=xY=y — PT a \Y=y\\ = 
\\P S'\X=xY=y — P S \X=xY=y\\ = 0. 

We see that for all pairs x, y we have P ' s'\x=xY=y = p s'\x=x an d Pt'\x=xY=v = 

P T'\Y=y 

We further get 

\\Px y Ps'T'\xy — PxystW 

< £i + ||PxyPs'T'|xy — PxystII 

= £1+ ^2 \ P XY(x,y)P SIT '\X=xY=y(s,t) -PxY(x,y)P S T\X=xY=y{s,t)\ 

x,y,s,t 

< £i + ^2 p xY{x,y)(\\P s \x=x - Ps|x=sY=J + ||PT|y=« - p t\x=xY= v \\) 

x,y 

<£i + \\PxyPs\x - Psxy\\ + \\PxyPt\y - Ptxy\\ 

< 3ei -I- 2e 2 - □ 
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We can now prove a non-signaling analogue of Lemma I15I 8 I 

Lemma 24. Let a game = (Q, Pxy), its n-fold repetition & n , and a no-signaling 
strategy [h a , hb) for <S n be given. Let indices i\, . . . , i m be given. Then, there exists an 
index i m +i such that 



Fr[W im+1 \W ix A-.-AWiJ 



<v ns (<5) + W 



'log 



n-mV VPrfWii A • • • A Wi, 



(64) 



Proof. As in the proof of Lemma [15] we assume that ii = n — £ + 1 and we de- 
fine W := W n - m +i A- • • AW„. The no-signaling property of (/i , /i;,) implies Px n Y n A n — 
Px"-Py"\X"Pa"\X" — Pa™X"Py"|X"- Thus, when we apply Corollary [5] on this distri- 
bution (with the event W and the random variables T = (X n ,A n ) and Uj — Yj) we 
get 



Elk 



— ^ Pty,|w ~ Pr|tyPy,-|r 

3=1 

-V ( "- m)1 ° g (prW])- 



Taking appropriate marginals this gives 



51 IPx^^IW-P^A.IwP^I^I < y(n-m)log( — 
Applying Lemma [5] once more and rearranging we get 

< 2 

I ' ' *'J l"J " — 

Symmetrically, we obtain 



[W] 



3=1 



(n-ro) log(p^ 



(65) 



£ |K^I^ - PxrP^ WW r| < 2«/(n - m) log( 



(66) 



3 = 1 



From (|6"B")) . JHH), and Lemma |2"51 we get that there exists a distribution Pa'.b'Ixy which 
can be implemented by no-signaling functions and for which 



J2\\PxyPa>b>\xy -PxjYjAtB^wj < 10 J (n - m) (log( — 



3=1 



[W] 



8 A previous version of the proof of this lemma contained an error, which was first noticed by Oded 
Regev and Ricky Rosen. 
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Thus, if Alice and Bob use the strategy implied by Pa'b'.\xy (which is no-signaling) 

they can win the initial game with probability PrfW^W]— lQ\/l/(n — m)-\/log(l/Pr[W]) 
for some j, which implies the lemma. □ 

Proof (of Theorem \21)) . Fix a no-signaling strategy (h a , hb) for (3. As in the proof of 
TheoremU]we repeatedly select indices i m +\ such that Pr[W im+1 |W il A • • • A Wi m ] is 
minimized. Let p m := Pr[VK^ A ■ ■ • A Wi m ]. Lemma |2~41 implies 

Pm+l <Pm- (v + 10j—!—log(—)). (67) 

From (f67|) we obtain (|56|) in the same way as we obtained (|4Tj) from (|54l) in the proof 
of Theorem [13 □ 
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A Non-triviality 

Local case We quickly reproduce a slight modification^] of Fortnow's example |For89j 
which shows that in general v(& 2 ) > (v(<3)) 2 . The same variation was also considered 
by Feige and Lovasz |FL92| . 

The game we describe is over bits (i.e., all the queries and all the responses are 
bits). We set P X y(0,0) := P x y(0,l) := Pxy(l,0) := |, and define 

Q(x,y,a,b):=((xVa)^(yVb)). (68) 

This can be described in words: Alice and Bob receive a bit, and at least one of these 
bits is 0. If both players receive 0, exactly one player must respond with 1. If one of 
the players receives 1, the other must respond with 0. 

We first show that for this game v = |. Clearly, v > § (e.g., both players always 
answer 0). To show v < | we check all deterministic strategies. If both players reply 
on query 0, this fails in case x = y = (and thus with probability i). If one player, 
w.l.o.g. Alice, answers with 1 the players fail in case x = and y = 1. 

9 Fortnow also lets the referee choose x = y = 1 with some probability, in which case the players 
cannot win the game. 



24 



If this game is repeated twice in parallel, setting (01,02) := [x2,%i), (^1,^2) := 
(2/2,2/1) also wins with probability |. One can check this as follows: for every fixed 
query (xi, yi) answering with (x 2 , 2/2) wins the first subgame with probability §. More- 
over, with this strategy Q(x±, 2/1, Oij b±) = Q(x2, 2/2, 02, b 2 ) which implies the claim. 

No-signaling case We now show that for the above game 

v(<5) =v{® 2 ) = v ns {<&) = v ns (<5 2 ). (69) 

Previously, it was known that quantum strategies do not help Alice and Bob to win 
this game |Wat02j (in both the single instance case and where two parallel instances 
are used). 

To show (|69p it is sufficient to show that that v(<8) = v ns (<8) (since v ns (<5 2 ) < 
v ns (&) and v ns (<£> 2 ) > v(<& 2 ) = v(<5) are already known). There are two ways to see 
that t>((8) — v ns (<B). First, one can notice that the joint probability of Alice's and Bob's 
reply only matters if x = y = 0; i.e., only for one query. In such a case one can always 
get a local strategy which is as good as a given no-signaling strategy. Alternatively, 
let p be the probability that Alice replies on query and q be the probability that 
Bob replies on query 0. In this case, the players win with probability at most p on 
query (x,y) = (0, 1), with probability at most q on query (1,0), and with probability 
at most (1 — p) + (1 — q) on query (0, 0), which gives an overall winnig probability of 
at most |. 



B A Lemma on Relative Entropy 

This following lemma is well known, but we do not know of a standard reference 
containing a proof of it. 

Lemma 25. Let Pjjk — . . . Pjj k and Pyk be distributions over the same set. Then, 

k 

^(PvJp^dMp^)- 

3=1 

Proof. We prove the bipartite case; the general case follows by induction. 
D{Pv,v 2 \\Pu x Pu 2 ) 

P VlV2 Mi, U 2 ) log — — — — - 
, . VP[/i(wi)Pt/ 2 (W2 

D f \ 1 f p Vi(ui) 

2- p ^^ (Ml ' U2)log lp^M 



(ui,«2) 



P^v 2 (m, U2 )log( pM 



(lil,U 2 ) 
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) 



> X?(P Vl ||P^t) + I?(Pv a ||Pl7 a ), 



where the last inequality follows from the log-sum inequality (see |CT911 Theorem 



C Converse of Lemma [18 

In this appendix we show that Lemma [TB] can be strengthened to get an "if and only 
if" condition. 

Lemma 26. Let a conditional distribution P z\ab be given. If, for all product distri- 
butions Pab = Pa^b the Markov condition A <-> Z «-> B is satisfied, then there exists 
functions /(a, z) : A x Z — » [0, 1] and g(b, z):6x2-»[0,l] such that 



Proof. Fix an arbitrary z throughout the proof, and consider arbitrary elements a, a 1 € 
A and b,b' £ B. We set P A (a) = Pa(o') = 5 and P B (6) = P B (b') = \. The Markov 
condition implies 



2.7.1]). 



□ 



Pz\A=aB=b = f{a, z) ■ g(b, z). 



(70) 



P A\Z=zB=b( a ) 



P A\Z=zB=b'{ a ) 



which is equivalent to 



PABz(a,b, z) 



P ABz{a,b' ,z) 



PABz{a, b, z) + Pabz(cl', b, z) 



Pabz(cl, b', z) + PABz(a',b', z) 



or (because of our choice of Pab) 



P Z\A=a,B=b{ z ) 



P Z\A=a,B=b'{z) 



P ' Z\A=a,B=b{z) + Pz\A=a>,B=b(z) 



P Z\A=a.B=b'{z) + P Z \A=a',B=b'{z) ' 



Analogously one gets (by swapping the roles of a and a') 




P Z\A=a,B=b{z) + P Z\A=a> ,B=b(z) P Z\A=a.B=b> {z) + P ' Z\A=a> ,B=V \z) ' 



Together, this implies 



Pz\A=a,B=b{z)Pz\A=a'.B=b'(z) = P Z\A=a,B=b' (z)P Z\A=a> ,B=b(z) ■ 



(71) 
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Fix now z and let /(•, z) and g(-, z) be functions onto [0, 1] which satisfy 



f(a,z)g(b,z) > P z \ A = a ,B=b(z) (72) 

for all (a, b) and for which the number of pairs (a, b) for which /(a, z)g(b, z) > P z\A=aB=b{ z ) 
is minimal (such functions exist since f(a,z) — g{b,z) — 1 satisfy ([72^) 1. We assume 
this number is non-zero and obtain a contradiction. For this, let (di, b\) be a pair for 
which f(ai, z)g(bi, z) > and for which the quotient P z\A=a 1 .B=b 1 { z )lf( a ii z )g(bi, z) < 
1 is minimal. 



We define 



f'(a,z) 



and 



g'(b,z) 



f(a,z) ifa^ax, 

Pzl A = o,B = bi (^) . r 

— ' — r , — r 1 — 11 a = ax 



g(b,z) if^6i, 

P Zl A 7r ,B = b (z) i{b = b 



We note that /' and g' cannot take values larger than 1. For example, f'{a\,z) > 1 
implies P z\A=a 1 .B=b 1 ( z ) > g{bi,z) > f(ai,z)g(bi,z), which contradicts (|72|) . We 
further claim that either the pair (/',<?) or (/, g 1 ) still satisfies (1721 . Otherwise, there 
are values 02 and 62 such that 



f'(ai,z)g(b 2 ,z) = ZlA ° 1 " B \ (b 2 ,z) > P Z \A= ai B=b 2 { z ) 
g(bi,z) 



and 



f{a2, z )g'{b 1 ,z) = f(a 2 ,z) PzlA ai ' B bl ^' > > P z\A= a .,B=b 1 { z )> 



which implies 

Pz\A=a u B=b 1 ( z ) P Z\A=a u B=b 1 { z ) 

g(h, z ) f(ai,z) 
and using (|7T]) 



g(b 2 ,z)f(a 2 ,z) > P 'z\A=a2B=bA z )P 'z\A= ai B=b 2 ( z )y 



P Z\A=a 1 .B=b 1 ( z ) Pz\A=a 2 ,B=b 2 ( z ) 

f(ai,z)g(h,z) g(b 2 ,z)f(a 2 ,z) 

contradicting the way we chose (ai,6i). Thus, either (f',g) or (f,g') still satisfies 
([72)) and since the respective version of ([72]) is satisfied with equality for at least 
one more pair (a, b) (namely for (ai,6i)) than for which (/,<?) satisfies it, we get a 
contradiction. □ 
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